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SEQUENCE LISTING 

3 (1) GENERAL INFORMATION: 

5 (i) APPLICANT: Hillman, Jennifer L. 

6 Goli, Surya K. 

8 (ii) TITLE OF INVENTION: NOVEL HUMAN MLS 3 PROTEIN 

11 (iii) NUMBER OF SEQUENCES: 6 

13 (iv) CORRESPONDENCE ADDRESS: 

14 (A) ADDRESSEE: Incyte Pharmaceuticals, Inc. 

15 (B) STREET: 3174 Porter Drive 

16 (C) CITY: Palo Alto 

17 (D) STATE: CA 

18 (E) COUNTRY: USA 

19 (F) ZIP: 94304 

21 (v) COMPUTER READABLE FORM: 

22 (A) MEDIUM TYPE: Diskette 
2 3 (B) COMPUTER: IBM Compatible 

24 (C) OPERATING SYSTEM: DOS 

25 (D) SOFTWARE: FastSEQ for Windows Version 2, 
27 (vi) CURRENT APPLICATION DATA: 

C--> 28 (A) APPLICATION NUMBER: US/10/02 3 , 929 

C--> 29 (B) FILING DATE: 17-Dec-2001 

35 (C) CLASSIFICATION: 

37 (vii) PRIOR APPLICATION DATA: 

33 (A) APPLICATION NUMBER: US/09/205,44 9 

34 (B) FILING DATE: 1998-12-04 

38 (A) APPLICATION NUMBER: 08/805,965 

39 (B) FILING DATE: 1997-02-25 

41 (viii) ATTORNEY/AGENT INFORMATION: 

42 (A) NAME: Billings, Lucy J. 
4 3 (B) REGISTRATION NUMBER: 36,749 
4 4 (C) REFERENCE/DOCKET NUMBER: PF-0223 US 
4 6 (ix) TELECOMMUNICATION INFORMATION: 

47 (A) TELEPHONE: 415-855-0555 

48 (B) TELEFAX: 415-845-4166 

49 (C) TELEX: 

52 (2) INFORMATION FOR SEQ ID NO: 1: 

54 (i) SEQUENCE CHARACTERISTICS: 

55 (A) LENGTH: 262 amino acids 

56 (B) TYPE: amino acid 

57 (C) STRANDEDNESS : single 

58 (D) TOPOLOGY: linear 

60 (vii) IMMEDIATE SOURCE: 

61 (A) LIBRARY: BRAITUT02 
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62 (B) CLONE: 762280 

64 (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1: 

66 Met Phe Arg Phe Met Arg Asp Val Glu Pro Glu Asp Pro Met Phe Leu 

67 1 5 10 ' 15 

68 Met Asp Pro Phe Ala He His Arg Gin His Met Ser Arg Met Leu Ser 

69 20 25 30 

70 Gly Gly Phe Gly Tyr Ser Pro Phe Leu Ser He Thr Asp Gly Asn Met 

71 35 40 4 5 

72 Pro Gly Thr Arg Ala Ala Ser Arg Arg Met Gin Gin Ala Gly Ala Val 

73 50 55 60 

W--> 74 Xaa Pro Phe Gly Xaa Leu Gly Met Ser Gly Gly Phe Met Asp Met Phe 

75 65 70 75 80 

W--> 76 Gly Met Met Asn Asp Met Xaa Gly Asn Met Glu His Met Thr Ala Gly 

77 85 90 95 

78 Gly Asn Cys Gin Thr Phe Ser Ser Ser Thr Val He Ser Tyr Ser Asn 

79 100 105 110 

80 Thr Gly Asp Gly Ala Pro Lys Val Tyr Gin Glu Thr Ser Glu Met Arg 

81 115 120 125 

82 Ser Ala Pro Gly Gly He Arg Glu Thr Arg Arg Thr Val Arg Asp Ser 

83 130 135 140 

84 Asp Ser Gly Leu Glu Gin Met Ser He Gly His His He Arg Asp Arg 

85 145 150 155 160 

86 Ala His He Leu Gin Arg Ser Arg Asn His Arg Thr Gly Asp Gin Glu 

87 165 170 ~ " 175 

88 Glu Arg Gin Asp Tyr He Asn Leu Asp Glu Ser Glu Ala Ala Ala Phe 

89 180 185 190 



Asp Asp Glu Trp Arg Arg Glu Thr Ser Arg Phe Arg Gin Gin Arg Pro 



90 

91 195 200 205 

92 Leu Glu Phe Arg Arg Leu Glu Ser Ser Gly Ala Gly Gly Arg Arg Ala 

93 210 215 220 

94 Glu Gly Pro Pro Arg Leu Ala He Gin Gly Pro Glu Asp Ser Leu Pro 

95 225 230 235 240 

96 Asp Ser Pro Ala Ala Met Thr Gly Glu Gly Pro Gly Ala Ser Ala Leu 

97 245 250 ~ 255 

98 Leu Tyr Arg Leu Arg Gly 

99 260 

101 (2) INFORMATION FOR SEQ ID NO: 2: 

103 (i) SEQUENCE CHARACTERISTICS: 

104 (A) LENGTH: 1322 base pairs 

105 (B) TYPE: nucleic acid 

106 (C) STRANDEDNESS : single 

107 (D) TOPOLOGY: linear 

109 (vii) IMMEDIATE SOURCE: 

110 (A) LIBRARY: BRAITUT02 
HI (B) CLONE: 762280 

113 (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2: 

115 GGGGGGCGTA CGGAGGTGGC AGCTGTGGGA GGAGGCGGCG TGGAAGGCCG AGGAGCTCAA 6 0 

116 GCCCGGACCA ATCCCCACGT TCCGGGCCGC CACCCTGACC CTGCAGCGTA CCGGGAAGCG 12 0 

117 AAACCGGCCG GATGGGCCGC TGAGCCCGAA TCGGGCACTG TGTGGAGCCC CCTGGAGCTG 180 
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118 AGATCAGGAT GTTCCGCTTC ATGAGGGACG TGGAGCCTGA GGATCCCATG TTCCTGATGG 24 0 

119 ATCCCTTTGC TATTCACCGT CAGCATATGA GCCGTATGTT GTCAGGTGGC TTTGGATATA 300 

120 GCCCCTTCCT CAGCATCACA GATGGCAACA TGCCAGGGAC CAGGGCTGCC AGCCGCCGGA 360 

121 TGCAGCAGGC TGGAGCTGTC TNCCCCTTTG GGNTGCTGGG AATGTCGGGT GGTTTCATGG 420 

122 ACATGTTTGG GATGATGAAT GACATGNTTG GAAACATGGA ACACATGACA GCTGGAGGCA 4 80 
12 3 ATTGCCAGAC CTTCTCATCT TCCACTGTCA TCTCCTACTC CAATACGGGT GATGGTGCCC 54 0 

124 CCAAGGTCTA CCAAGAGACA TCAGAGATGC GCTCGGCACC AGGCGGGATC CGGGAGACAC 600 

125 GGAGGACTGT TCGGGATTCA GACAGTGGAC TGGAGCAGAT GTCCATTGGG CATCACATCC 6 60 

126 GGGACAGGGC TCACATCCTC CAGCGCTCCC GAAACCATCG CACGGGGGAC CAGGAGGAGC 72 0 

127 GGCAGGACTA TATCAACCTG GATGAGAGTG AGGCCGCAGC GTTTGATGAC GAGTGGCGGC 78 0 
12 8 GGGAGACCTC CCGATTCCGG CAGCAGCGTC CCCTGGAGTT TCGGCGGCTT GAGTCCTCAG 84 0 
12 9 GGGCTGGGGG ACGAAGGGCG GAGGGGCCTC CCCGCCTGGC CATCCAGGGA CCTGAGGACT 900 

130 CCCTTCCCGA CAGTCCCGCC GCTATGACTG GTGAGGGCCC CGGGGCCTCA GCTCTCTTGT 96 0 

131 ACAGGCTGAG AGGCTGAGAA ATCATCCCCT GAATAACTTT TTCCTCTCGA TTCCCATCCC 102 0 

132 CAATTTAATA TTAAATTAAC AGGCAAGCCG GCCCCCACCT CTCCCTGGGG GTCTCAGGGA 108 0 

133 GAACCTTTCA CGGCACCCTT TCCCTACCTT TTCCTTCTTT AATCTCCTGG TTTACCATTG 114 0 

134 ATGACTTCGG CTCTGCATCT ACTTACTTGA TTTTTCATTC TGCCACTTCA TCTTCAAACC 1200 

135 CCCTCACCTT TCCCATCCTA CTCCTGCCAT GCATTGAAGG GTCAATGCAT TTTGGGGTGA 12 6 0 

136 GNTTNGGTTT AGGGGCCCCT TCATNCCTNA GCTACCTGGG TCTTTGCCCA ACTTTTCTCA 132 0 

137 GA 1322 
139 (2) INFORMATION FOR SEQ ID NO : 3: 

141 (i) SEQUENCE CHARACTERISTICS: 

142 (A) LENGTH: 268 amino acids 

143 (B) TYPE: amino acid 

144 (C) STRANDEDNESS : single 

145 (D) TOPOLOGY: linear 
14 7 (vii) IMMEDIATE SOURCE: 

148 (A) LIBRARY: GenBank 

149 (B) CLONE: 1066392 

151 (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3: 

153 Met Phe Arg Met Leu Asn Ser Ser Phe Glu Asp Asp Pro Phe Phe Ser 

154 1 5 10 15 

155 Glu Ser lie Leu Ala His Arg Glu Asn Met Arg Gin Met lie Arg Ser 

156 20 25 30 

157 Phe Ser Glu Pro Phe Gly Arg Asp Leu Leu Ser lie Ser Asp Gly Arg 

158 3 5 4 0 4 5 

159 Gly Arg Ala His Asn Arg Arg Gly His Asn Asp Gly Glu Asp Ser Leu 

160 50 55 60 

161 Thr His Thr Asp Val Ser Ser Phe Gin Thr Met Asp Gin Met Val Ser 

162 65 70 75 80 

163 Asn Met Arg Asn Tyr Met Gin Lys Leu Glu Arg Asn Phe Gly Gin Leu 

164 85 90 ~ 95 

165 Ser Val Asp Pro Asn Gly His Ser Phe Cys Ser Ser Ser Val Met Thr 

166 100 105 110 

167 Tyr Ser Lys He Gly Asp Glu Pro Pro Lys Val Phe Gin Ala Ser Thr 

168 115 120 125 

169 Gin Thr Arg Arg Ala Pro Gly Gly He Lys Glu Thr Arg Lys Ala Met 

170 130 135 140 

171 Arg Asp Ser Asp Ser Gly Leu Glu Lys Met Ala He Gly His His He 
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172 145 150 155 160 

173 His Asp Arg Ala His Val lie Lys Lys Ser Lys Asn Lys Lys Thr Gly 

174 165 170 175 

175 Asp Glu Glu Val Asn Gin Glu Phe lie Asn Met Asn Glu Ser Asp Ala 

176 180 185 190 

177 His Ala Phe Asp Glu Glu Trp Gin Ser Glu Val Leu Lys Tyr Lys Pro 

178 195 200 205 

179 Gly Arg His Asn Leu Gly Asn Thr Arg Met Arg Ser Val Gly His Glu 

180 210 215 220 

181 Asn Pro Gly Ser Arg Glu Leu Lys Arg Arg Glu Lys Pro Gin Gin Ser 

182 225 230 235 240 

183 Pro Ala lie Glu His Gly Arg Arg Ser Asn Val Leu Gly Asp Lys Leu 

184 245 250 255 

185 His lie Lys Gly Ser Ser Val Lys Ser Asn Lys Lys 

186 260 265 
188 (2) INFORMATION FOR SEQ ID NO : 4: 

190 (i) SEQUENCE CHARACTERISTICS: 

191 (A) LENGTH: 1116 base pairs 

192 (B) TYPE: nucleic acid 

193 (C) STRANDEDNESS: single 

194 (D) TOPOLOGY: linear 

196 (vii) IMMEDIATE SOURCE: 

197 (A) LIBRARY: GenBank 

198 (B) CLONE: 1066391 

200 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

2 02 GTTATGTGTT CCCGTCCGTA CTGGAGGCTA GCTCTTGTCG CGGCCGCGGC GAGTTAACAT 6 0 

203 CGTTTTTCCA ATCTGTCCGC GGCTGCCGCC ACCCAAGACA GAGCCAGAAT GTTCAGGATG 12 0 

2 04 CTGAACAGCA GTTTTGAGGA TGACCCCTTC TTCTCTGAGT CCATTCTTGC ACACCGAGAA 180 

205 AATATGCGAC AGATGATAAG AAGTTTTTCT GAACCCTTTG GAAGAGACTT GCTCAGTATC 24 0 

2 06 TCTGATGGTA GAGGGAGAGC TCATAATCGT AGAGGACATA ATGATGGTGA AGATTCTTTG 30 0 

207 ACTCATACAG ATGTCAGCTC TTTCCAGACC ATGGACCAAA TGGTGTCAAA TATGAGAAAC 36 0 

2 08 TATATGCAGA AATTAGAAAG AAACTTCGGT CAACTTTCAG TGGATCCAAA TGGACATTCA 42 0 

2 09 TTTTGTTCTT CCTCAGTTAT GACTTATTCC AAAATAGGAG ATGAACCGCC AAAGGTTTTT 4 80 

210 CAGGCCTCAA CTCAAACTCG TCGAGCTCCA GGAGGAATAA AGGAAACCAG GAAAGCAATG 54 0 

211 AGAGATTCTG ACAGTGGACT AGAAAAAATG GCTATTGGTC ATCATATCCA TGACCGAGCT 6 00 

212 CATGTCATTA A A A AG T C A A A GAACAAGAAG ACTGGAGATG AAGAGGTCAA CCAGGAGTTC 6 60 

213 ATCAATATGA ATGAAAGCGA TGCTCATGCT TTTGATGAGG AGTGGCAAAG TGAGGTTTTG 72 0 

214 AAGTACAAAC CAGGACGACA CAATCTAGGA AACACTAGAA TGAGAAGTGT TGGCCATGAG 7 80 

215 AATCCTGGCT CCCGAGAACT TAAAAGAAGG GAGAAACCTC AACAAAGTCC AGCCATTGAA 84 0 

216 CATGGAAGGA GATCAAATGT TTTGGGGGAC AAACTCCACA TCAAAGGCTC ATCTGTGAAA 9 00 

217 AGCAACAAAA AATAAATAGC CATGCATTTG ATTTGTTTAG TTTTGATTGT TTTAACAGTT 96 0 

218 AGTAATGGTG CTGGGTAATA AGCATAAGAC CAATCTCTTG CTGTTAAATC AGTTCTGTCC 102 0 

219 TTGGCAACTT TCTTCTGATA TCTGAATGT.T CATGAAGGTC CTAGCTTTAT ATTGTCCCTC 1080 
2 20 TTTTAGGAAT AAAATTTTGA TTTTCAACAA AAAAAA 1116 
222 (2) INFORMATION FOR SEQ ID NO : 5: 

224 (i) SEQUENCE CHARACTERISTICS: 

225 (A) LENGTH: 248 amino acids 

226 (B) TYPE: amino acid 

227 (C) STRANDEDNESS: single 
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228 (D) TOPOLOGY: linear 

230 (vii) IMMEDIATE SOURCE: 

231 (A) LIBRARY: GenBank 

232 (B) CLONE: 1399745 

234 (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 5: 
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26 9 (2) INFORMATION FOR SEQ ID NO : 6: 

2 71 (i) SEQUENCE CHARACTERISTICS: 

272 (A) LENGTH: 1502 base pairs 

273 (B) TYPE: nucleic acid 

274 (C) STRANDEDNESS : single 

275 (D) TOPOLOGY: linear 

277 (vii) IMMEDIATE SOURCE: 

278 (A) LIBRARY: GenBank 

279 (B) CLONE: 1399744 

281 (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 6: 

283 CTCTAAAGGG CAGCTGTGGG AGGAGGCGGC GTGGAAGGCC GAGGAGCTCA AGCCCGGACC 6 0 

2 84 AATCCCCACG TTCCGGGCCG CGACCCTGAC CCTGCAGCGT ACCGGGAAGC GAAACCGGCC 12 0 
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VERIFICATION SUMMARY DATE: 02/11/2 002 

PATENT APPLICATION: US/10/023,929 TIME: 08:42:55 

Input Set : N:\Crf3\RULE60\10023929.raw 
Output Set: N:\CRF3\02112002\J023929.raw 

L:28 M:220 C: Keyword misspelled or invalid format, [(A) APPLICATION NUMBER : ] 

L:29 M:220 C: Keyword misspelled or invalid format, [(B) FILING DATE:] 

L:74 M:341 W: (46) " n " or "Xaa" used, for SEQ ID# : 1 

L:76 M:341 W: (46) "n" or "Xaa" used, for SEQ ID#:1 
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